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SPEKKENS’S SYMMETRIC NO-GO THEOREM 


ANDREAS BLASS AND YURI GUREVICH 


In [6], Spekkens clarifies the ways in which classical theories dif¬ 
fer from quantnm mechanics. He improves the traditional notions of 
non-negativity of Wigner-style qnasi-probability distributions and non- 
contextuality of observations. He argues that the improvements more 
accurately capture what a classical universe would look like. Thus, 
both of these improved notions serve to distinguish quantum theory 
from classical theories, in particular from theories that use hidden vari¬ 
ables in an attempt to explain the results of quantum mechanics on a 
classical basis. Spekkens then shows that the two improved notions are 
equivalent to each other. 

Spekkens’s improvements of non-negativity and non-contextuality 
emphasize the involvement of both preparations and measurements. In 
the second part of [6], Spekkens provides what he calls an even-handed 
approach to a no-go theorem. The theorem asserts that the require¬ 
ment of non-contextuality (or equivalently of non-negativity) prevents 
a theory from matching the predictions of quantum mechanics; in other 
words, non-contextual hidden-variable theories can’t succeed. “Even- 
handed” means that the proof treats preparations and measurements 
in a symmetrical way. 

The paper [6] contains some minor inaccuracies and one false claim, 
in the proof of the no-go theorem. The false claim is “that a function 
/ that is convex-linear on a convex set S of operators that span the 
space of Hermitian operators (and that takes value zero on the zero 
operator if the latter is in S) can be uniquely extended to a linear 
function on this space.” Unfortunately, this claim, early in the proof, 
is used in an essential way in the rest of the argument. In this note, 
we analyze carefully Spekkens’s proof of the no-go theorem, explain 
the inaccuracies, reduce the task of proving the no-go theorem to the 
special case of a single qubit and then prove the special case. This 
gives us a complete proof of Spekkens’s no-go theorem. An alternative 
proof of the no-go theorem is given in the series of papers dials]. 
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1. Definitions 

Spekkens defines a quasiprobability representation of a quantum sys¬ 
tem by the following features. 

QPRl Every density operator p is represented by a normalized and 
real-valued function pp on a measurable space A. 

QPR2 Every positive operator-valued measure (POVM) {E^} is repre¬ 
sented by a set of real-valued functions on A that sum to 

the unit function on A. (The trivial POVM {/} is represented 
by ^ 7 (A) = 1, and the zero operator is represented by the zero 
function.) 

QPR3 For all density operators p and all POVM elements we have 

Tr(pEfc) = /dA/Xp(A)^ij,(A). 

A quasiprobability representation is called nonnegative if all the func¬ 
tions Pp and take only nonnegative values. 

We begin our analysis by looking carefully at the notions used in 
this definition of quasiprobability representation and clarifying some 
aspects of the definition. 

1.1. Density operator. Within the definition of quasiprobability rep¬ 
resentation, Spekkens explains “density operator” as “a positive trace- 
class operator on a Hilbert space "H”. Although “trace-class” implies 
that the operator p has a well-defined trace, Spekkens presumably in¬ 
tended more, namely that the trace Tr(p) should be equal to 1. This 
would conform with the usual meaning of “density operator”; it would 
also account for the requirement that Pp be normalized. If one could 
multiply p by a positive real factor and still have a density operator, 
then the associated pp should also be multiplied by the same factor. 
From now on, we shall assume that “trace 1” is included in the defini¬ 
tion of density operator. 

It is also worth remembering that “trace-class” is important only 
in the context of infinite-dimensional Hilbert spaces. If "H is finite¬ 
dimensional, then all (linear) operators on it are in the trace class. 
Spekkens’s proof of the no-go theorem does not require an infinite¬ 
dimensional space; it works as long as the dimension of "H is at least 
2. So for many purposes, we need not worry about the “trace-class” 
clause in the definition of density operators. 

1.2. Measurable space. The phrase “measurable space” is standard 
terminology for a set X together with a a-algebra S of subsets of V, 
the members of S being called the measurable sets. A measurable 
space differs from a measure space in that the latter has, in addition to 
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X and S, a countably additive measure defined on all the measurable 
sets. 

We believe that Spekkens intends A to be not merely a measurable 
space but a measure space. He uses the formula J fip{X) dA = 1 as 
the definition of the requirement in QPRl that fip be normalized. This 
integral and the one in clause QPR3 of the definition of quasiprobability 
representation both presuppose the presence of a measure to make sense 
of dX. They also presuppose that the functions /Xp are measurable. 

An alternative modification to make sense of these integrals would 
be to change the requirement that p is represented by a function and 
to require instead that it be represented by a measure, say z/p. The 
notation /ip(A) dX could then be taken to be syntactic sugar for dvp{X). 
This alternative approach has, as far as we can see, two disadvantages 
and two advantages. The first disadvantage is that it requires us to 
understand Spekkens’s notation /ip(A) dX, which looks like a standard 
notation, as syntactic sugar for something rather different. The second 
is that it explicitly contradicts Spekkens’ assertion that /ip should be a 
function. The first advantage is that it preserves Spekkens’s convention 
that A is merely a measurable space, not a measure space. The second 
advantage is that it is more general. In the approach with an a priori 
given measure dX, multiplying it by functions Pp produces only those 
measures /ip(A) dX that are absolutely continuous with respect to dX. 
The alternative approach allows arbitrary measures (on the given a- 
algebra S) without any requirement of absolute continuity. 

1.3. Positive operator-valued measures. The second defining fea¬ 
ture, QPR2, of a quasiprobability representation represents positive 
operator valued measure£| {Ek\ by sets of functions The elements 
Ek of a POVM are positive Hermitian operators such that I — E^ is 
also positive. That is, the spectrum of E^ lies in the interval [0,1] of 
the real line. Conversely, any such operator occurs as a member of 
some POVM, and usually as a member of many POVMs. Specifically, 
if i? is a positive Hermitian operator and I — E is also positive, then 
{E, I — E} is a POVM; unless E = I, we can replace I — E in this 
POVM by two or more positive operators whose sum is I — E, thereby 
obtaining other POVMs containing E. 


^We follow Spekkens’s usage of “POVM” to refer to a discrete set of operators. 
This usage agrees with the standard text [4]. There is a generalization, involving 
operator-valued measures; see for example [S]. For our purposes, the simpler version 
is adequate, since the no-go theorem for these simpler POVMs implies the theorem 
for the broader class. 
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The question arises whether the function ^Ek ^ quasiprobability 
representation can depend on the POVM from which was taken or 
must depend only on the operator itself. The wording of the deh- 
nition suggests the former, while the notation ^e^ suggests the latter. 
Fortunately for our purposes, Spekkens’s dehnition of “measurement 
noncontextuality” requires that “depends only on the associated 
POVM element Ek” (italics added). Since our goal in this paper, the 
no-go theorem, is about noncontextual representations, we can safely 
follow the notation and assume that ^e depends only on E, not on 
the POVM in which it occurs (and, a fortiori, not on the measurement 
process by which that POVM is realized). 

2. An additional hypothesis 

At the beginning of his proof of the no-go theorem, Spekkens notes 
that a mixture p = density operators pj with weights Wj 

can be prepared by hrst randomly choosing one value of j from the 
probability distribution {tCj} and then preparing pj. He infers that 
“clearly” pp{X) = Ej(A)- 

Although this inference is highly plausible and natural on physical 
grounds, it does not follow from just the dehnition of quasiprobability 
distribution as quoted above. Suppose that the functions ^e do not 
span the whole space of square-integrable functions on A, so that there 
is a function a orthogonal to all of these where “orthogonal” 

means that J (t(A)^e(A) dX = 0. One could modify the Pp functions by 
adding to each one some multiple of a, obtaining p'p = pp + Cpa and 
still satisfying the dehnition of quasiprobability representation. Here 
the coefficients Cp can be chosen arbitrarily for each density operator p. 
By choosing them in a sufficiently incoherent way, one could arrange 
that p'p{X) ^ Y.jWjp'p.{X). 

If, on the other hand, the do span the whole space of functions 
on A, then Spekkens’s desired equation Pp{X) = '^jdpjW does 
follow, for all but a measure-zero set of A’s, because the two sides 
of the equation must give the same result when integrated against any 

^E- 

Unfortunately, nothing in the dehnition of quasiprobability represen¬ 
tations requires the to span the whole space. For example, given 
any quasiprobability representation, we can obtain another, physically 
equivalent one as follows. Replace A by the disjoint union Ai U A 2 of 
two copies of A. Dehne the measure of any subset of Ai U A 2 to be 
the average of the original measures of its intersections with the two 
copies of A. Dehne all the functions Pp and ^e on the new space by 
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simply copying the original values on both of the Aj’s. The result is a 
quasiprobability representation in which the ^£;’s span only the space 
of functions that are the same on the two copies of A. 

The result of this discussion is that, in order to prove the no-go the¬ 
orem along the lines proposed by Spekkens, we must add an additional 
hypothesis about mixtures of densities. There is a similar assumption 
for mixtures of measurements. 

Convex-linearity Hypothesis: Let {tCj} be a probability distribu¬ 
tion on a set of indices j. 

• If P = Ej Wjpj, then pp{X) = Wjfipj{X). 

• li E = then ^£;(A) = Y,jWj^E^{X). 

This hypothesis is exactly statements (7) and (8) in [6]. The name of 
the hypothesis refers to the following terminology, which we shall need 
again later. 

Definition 1. Let C be a convex set in a real vector space V, and let 
/ be a function from C into another real vector space W. Then / is 
convex-linear on a subset S' of C if 

/(aiui H-h OnVn) = aif{vi) H-h a„/(u„) 

for all vectors Ui,..., G S and all nonnegative numbers Oi,..., a„ 
with oi = 1. 

Thus, the convex-linearity hypothesis says that the functions Pp 
and E are convex-linear on the sets of density matrices and 

POVM elements, respectively. 

3. The no-go theorem 

On an intuitive level, the no-^o theorem asserts that nonnegative 
quasiprobability representation^ subject to the convex-linearity hy¬ 
pothesis cannot reproduce the predictions of quantum mechanics. A 
considerable amount of agreement with quantum mechanics is already 
built into the dehnition of quasiprobability representations. Specih- 
cally, the equation Ti^pEk) = f Pp{X)^EkW dX says that the expecta¬ 
tion of Ek in state p is the same whether computed by the quantum 
formula Tr(pA'fc) or as an average using the functions Pp and ^e^ from 
the quasiprobability representation. Spekkens’s no-go theorem asserts 
that there is no nonnegative quasiprobability representation satisfying 
convex-linearity. 

^Tliese are equivalent to noncontextual ontological models, as Spekkens shows 
in the earlier sections of [5]. 
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A small technical point is that the no-go theorem presnpposes that 
the quantum mechanics is non-trivial. Quantum mechanics on Hilbert 
spaces of dimensions 0 or 1 is classical (and trivial), so we must as¬ 
sume that we are dealing with a Hilbert space "H of dimension at least 
2. An inspection of Spekkens’s argument reveals that he never uses 
any stronger assumptions about "H. Thus, the no-go theorem can be 
formally stated as follows. 

Theorem 2. For a Hilbert space FL of dimension at least two, there 
is no way to define nonnegative fXp, for all density operators p, and 
to define nonnegative ^e, for all positive Hermitian operators E with 
I — E positive, so as to satisfy both the definition of a quasiprobability 
representation and the convex-linearity hypothesis. 

4. Reduction to two dimensions 

In this section, we reduce the task of proving Spekkens’s no-go theo¬ 
rem to the special case where FL has dimension 2. (In the terminology 
of quantum computing, FL represents a single qubit.) More generally, 
we show that, if there were a nonnegative quasiprobability representa¬ 
tion satisfying convex-linearity for some Hilbert space FL, then there 
would also be such a representation, using the same measure space A, 
for any nonzero, closed subspace FL' of FL. 

To see this, suppose functions Pp (for all p) and (for all E) consti¬ 
tute such a representation for FL. Let i : FL' ^ FL he the inclusion map 
(the identity map of FL' regarded as a map into FL), and let p : FL ^ FL' 
be the orthogonal projection map (sending each vector in FL' to itself 
and sending each vector orthogonal to FL' to 0). Also, £x some unit 
vector I a) G FL'. 

Each density operator p on FL' gives rise to a density operator p = 
io pop onFL. For pure states, this amounts to just considering a state 
vector in FL' as a vector in the larger Hilbert space FL. For mixed states, 
the extension preserves averages. We begin defining a quasiprobability 
representation for FL' by setting p'^ = pp. We note that this is a nor¬ 
malized nonnegative real-valued function on A, and that it satisfies the 
part of convex-linearity that refers to the representations of densities. 

It is tempting to proceed exactly analogously with POVM elements 
E and their representing functions ^e- That procedure doesn’t quite 
work, because the definition of quasiprobability representation imposes 
a specific requirement on ^/, where I is the identity operator. Unfor¬ 
tunately, if / is the identity operator on FL', then z o J o p is not the 
identity operator on "H. So we must proceed slightly differently, and it 
is here that the fixed unit vector \a) will be useful. 
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Given a POVM element E on "H', i.e., a positive, Hermitian operator 
such that I — E is also positive, we dehne E to be the unique linear 
operator on "H such that 

I (a I i? I a) I if |V’)-L'H'. 

In other words, E agrees with E on l-L' and with a scalar multiple 
of the identity on the orthogonal complement of "H', the multiplier of 
the identity being {a\E\a). This extension process produces POVM 
elements for "H; indeed, if a set {Ek} of operators is a POVM for H', 
then {Ek} is a POVM for H. Furthermore, the extension process sends 
the identity and zero operators on "H' to the identity and zero operators 
on y,, and the process respects weighted averages. 

We continue the dehnition of a quasiprobability representation for 
y by setting for all POVM elements E on "H'. The remarks 

above immediately imply that these functions are as required by the 
second part, QPR2, of the dehnition of quasiprobability representation, 
that they are nonnegative, and that they satisfy the relevant part of 
the convex-linearity hypothesis. 

To verify the last part, QPR3, of the dehnition of quasiprobability 
representation, we observe that, for any density operator p and POVM 
element E on "H', the extensions p and E agree with p and E on 
y, while on the orthogonal complement of y, p acts as zero and E 
acts as a scalar multiple of the identity. It follows immediately that 
y[pE) = y{pE), and therefore 

Tr(pi5) = J 

as required. 

This completes the proof that nonnegative quasiprobability repre¬ 
sentations subject to convex-linearity can be “restricted” to nonzero, 
closed subspaces of the original Hilbert space. Therefore, it suffices to 
prove the no-go theorem in the special case where l-L has dimension 2. 

Remark 3. By concentrating on the case of dimension 2, we gain two 
advantages. First, we can avoid some technicalities that would arise for 
inhnite-dimensional Hilbert spaces. Second, we obtain a more concrete 
picture of the relevant spaces of density operators and measurements. 
(The hrst of these advantages would result from reduction to any hnite 
number of dimensions; the second benehts specihcally from dimension 
2 .) 
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5. Convex-linear transformations 

Spekkens asserts that, if a function / is convex-linear on a convex set 
S of operators that span the space of Hermitian operators (and / takes 
the value zero on the zero operator if the latter is in 5), then / can be 
uniquely extended to a linear function on this space. Unfortnnately, 
such a linear extension need not exist in the general case, when zero is 
not in sE For a simple example, consider the function that is identically 
1 on an that spans the space of Hermitian operators, does not contain 
0, but does contain two orthogonal projections and their snm. 

The correct version of the result extends / not to a linear fnnction 
bnt to translated-linear fnnction, i.e., a composition of translations 
and a linear fnnction. The rest of this section is devoted to a proof 
of this fact, in somewhat greater generality than we need. It applies 
to arbitrary real vector spaces; that the space consists of Hermitian 
operators is irrelevant. 

The convex hull, Conv(S'), of a snbset S' of a real vector space 
V consists of the convex combinations aiVi of vectors 

Vi,... ,Vn e S where Oi = 1 and every a* > 0. The affine 

hull, Aff(S), of S consists of the affine combinations aiVi 
of vectors ui,..., G S where oi a„ = 1 but some coefficients 

Ui may be negative. 

A set is convex if it contains all the convex combinations of its mem¬ 
bers; similarly, it is an affine space if it contains all the affine combi¬ 
nations of its members. An easy compntation shows that convex hulls 
are convex and affine hnlls are affine spaces; that is Conv(Conv(S')) = 
Conv(5) and Aff(Aff(^)) = Aff(F). 

An affine space A in a vector space V is said to be parallel to a linear 
subspace L oiV ii A = uo + L = {mq + n : n G L} for some Uq G V. It 
is easy to see that, if an affine space A is parallel to a linear space L as 
above, then (i) L is nniqne, (ii) uq G A, (iii) any vector in A can play 
the role of the translator uo, and (iv) A is either eqnal to L or disjoint 
from L. 


^Spekkens gives a formula purporting to define a linear extension of / in general, 
but it is not well-defined because it involves some arbitrary choices. He also gives, 
in footnote 18 of the newer version [7] of his paper, an argument purporting to 
show that his formula is independent of those choices, but that argument fails. 
It involves dividing by an appropriate constant C to turn two nonnegative linear 
combinations, the two sides of an equation, into convex combinations so that the 
assumption of convex-linearity can be applied. But the necessary divisor C may 
need to be different for the two sides of the equation. 
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Lemma 4 (§1 in [5]). Any affine subspace A of a real vector space V 
is parallel to a linear subspace L ofV. 

In other words, any affine subspace is a translation of a linear sub¬ 
space. For example, in R^, we have that Aff{(0,1), (1, 0)} is parallel 
to the diagonal y = —x, and Aff{(0,1), (1, 0), (1,1)} is (and thus is 
parallel to) R^. 

Proof. If A contains the zero vector 0 then it is a linear subspace. 
Indeed, if t G A then any multiple av = an -|- (1 — a)0 G A. And if 
u,v ^ A then u + v = 2{^u + ^v) G A. 

For the general case, let Uq be any vector in the affine space A. It 
suffices to show that L = {n — Mq : t G A} is an affine space, because 
then the preceding paragraph shows that it is a linear space, and clearly 
A = Uq + L. Any affine combination ai(Ti — Uq) — Uq) 

of vectors in L (so the Vi are in A and the sum of the a* is 1) can be 
rewritten as {aiV\ a„T„) — mq, which is in L. □ 

Let V and W be real vector spaces, S a subset oiV, C = Conv(S') its 
convex hull, and A = Aff(S') its affine hull. Recall that a transformation 
/ : C —)■ VF is convex-linear on S if 

f{aiVi H-h anVn) = aif{vi) ^ -h anf{Vn) 

for any convex combination aiVi OnUn of vectors Vi from S. 

A transformation / : A —)■ VF is translated-linear if it has the form 
f{v) = Wo + h{v — Uq) for some Wq G VF, some Uq G A, and some linear 
function h \ L dehned on the linear space L = A — uq parallel to 

A. 

Proposition 5. With notation as above, any transformation f : C ^ 
W that is convex-linear on S has a unique extension to a translated- 
linear function on A. 

Proof. Notice hrst that translations v ^ v — uq and linear functions 
both preserve affine combinations. A translated-linear function, be¬ 
ing the composition of two translations and a linear function, there¬ 
fore also preserves affine combinations. This observation implies the 
uniqueness part of the proposition. Indeed, every element of A is an 
affine combination oiSi of elements of S, and therefore any 

translated-linear extension of / must map it to ai/(si) H— ■ + anf{sn)- 
To prove the existence part of the proposition, it will be useful to 
work with the graphs of functions. For any function g : S ^ W with 
S <PV, its graph is the subset of R© VF consisting of the pairs (s, g{s)) 
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for s G S'H We record for future reference that the graph of 5 ^ is a linear 
subspace of Id © hh if and only if the domain of (7 is a linear subspace 
of V and is a linear transformation from that domain to W. We 
also note that the projection n : V (B W V : (v, w) 1 —^ v is a linear 
transformation that sends the graph of any g to the domain of g. 

In the situation of the proposition, let / : (7 —)■ W be a transfor¬ 
mation that is convex-linear on S, and let F C 1/ © W be its graph. 
Also, let F~ be the graph of the restriction of / to S. Notice that the 
convex-linearity of / on S' means exactly that F is the convex hull of 
F~. It follows that F and F~ have the same affine hull, because 

Aff(F) = Aff(Conv(F-)) C Aff(Aff(F-)) = Aff(F-) C Aff(F). 

We claim that this affine hull Aff(F“) is the graph of a function; that 
is, it does not contain two distinct elements (v,w) and {v,w') with the 
same first component v. To see this, suppose we had two such elements 
in Aff(F) = Aff(F“), say 

(v, w) = ai(si, /(si)) H-h am{sm, f{sm)) 

and 

[V, w') = biitijiti)) H-h bn{tn, f{tn)), 

where all the sds and t/s are in S and where 

( 1 ) ai + ■ ■ ■ + ttra = bi + ■ ■ ■ + bn, 

because both sides are equal to 1. So we have 

(2) OiSi + ■ • • + CLmSm = ‘ 

because both sides are equal to v, and we want to prove w = w', i.e., 

(3) Ol/(Sl) + • • • + CtmfiSm) = bif{ti) + • • ■ + bnf{tn)- 

In the special case where all coefficients a* and bj are > 0 , vector v is in 
C and both sides of ([3]) are equal to f{v). The general case reduces to 
this special case as follows. In all three equations ([I])-®, move every 
summand with a negative coefficient to the other side, and then divide 
the resulting equations by the left part of the rearranged equation ([T]). 
As a result we return to the special case already treated. Since the old 
version of (jS)) follows from the new one, this completes the proof of our 
claim that Aff(F) = Aff(F“) is the graph of a function. 

By LemmalU the affine space Aff(F) is parallel to a linear subspace H 
of Id© IT, say Aff(F) = {uq, wo) + H, where uq eV and wq G W. From 
the fact that Aff(F) is the graph of a function, it follows immediately 
that H is also the graph of a function. Indeed, if H contains (u, w) and 

^In set-theoretic foundations, a function is usually defined as a set of ordered 
pairs, and so g is the same thing as its graph. 
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(t, ty'), then Aff(F) contains {v — uo,w — wq) and (v — uq, w' — zco), so 
w — Wq = w' — Wq and w = w'. 

Let h be the function whose graph is H. Because H is a. linear 
subspace of IL 0 W, we know that h is a linear transformation from 
some linear subspace L of V into W. 

The fact that {uo,wo) + H = Aff(F) tells us, by applying the linear 
projection n -.V ®W ^ V, that uq + L equals 

7 r(Aff(F)) = Aff( 7 r(F)) = Aff(C') = A, 

where the hrst equality comes from linearity of vr and the second from 
the fact that F is the graph of the function / whose domain is C. So A 
is parallel to the linear subspace L of V. Furthermore, for each t G (7, 
we have 

(v, f(v)) E F C Aff(F) = (uo, Wo) + H, 

so (v — Uq, f{v) — Wo) is in the graph FI of h. That is, h{v — uq) = 
f{v) — Wo and so f{v) = tco + h{v — uo). Thus, the translated-linear 
function t icq + h{v — tto) is the desired extension of /. □ 

Remark 6 . A linear function h on a subspace L of a vector space V 
can be extended to a linear function h on all of V. Extend any basis 
of L to a basis of V, dehne h arbitrarily on the new basis vectors that 
are not in L, and extend the resulting function by linearity to all of V. 

For transformations dehned on all of V, we have a simpler formula 
for translated-linear functions, because 

Wo + h{v - Uo) = wo + h{v) - h{uo) = h{v) + wi, 
where wi = wo — h{uo). 

On the other hand, in contrast to Proposition [5l this h is not unique 
(unless L = V). 

Also, in the case of inhnite-dimensional spaces, the extension process 
requires the axiom of choice (to extend bases) and need not be well- 
behaved with respect to natural topologies on the vector spaces. 


6 . Density operators and POVM elements in two 

DIMENSIONS 


In this section, we recall the form of density operators and POVM 
elements in the case where Ri is two-dimensional. In this case, a basis 
for the Hermitian operators on Ri is given by the identity and the three 
Pauli matrices 


I = 



X 
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It will be convenient to use vector notation, denoting the triple of 
matrices {X, Y, Z) hj X. Then the general Hermitian matrix looks like 

wl + xX + yY + zZ = wl + X ■ X, 

where w and the three components of x are real numbers. The eigen¬ 
values of this Hermitian matrix are 

w ± \/x"^ + = w ± ||T|| 

In particular, the trace of this matrix is 2w, and the matrix is positive 
if and only if tc > ||T||. 

Density matrices are the Hermitian, positive matrices of trace 1, so 
they have the form 

p = p{x) = ^{I + x-X), 

where ||x|| < 1 . As indicated by the notation, we parametrize these 
density matrices by three-component vectors x of norm < 1. The three- 
dimensional ball that serves as the parameter space here is called the 
Bloch sphere (with its interior). 

Similarly, POVM elements have the form 

E = E{m, p) = ml + pX -|- qY + rZ = ml + p ■ X 

with 

11^1 < m < 1 - 11^1 

(because E and I — E are positive operators) and therefore ||p|| < 
The parameter space here, consisting of all four-component vectors 
satisfying these inequalities, is a double cone over a three-dimensional 
ball of radius 

We record for future reference the traces 

Tr(/) = 2, Tr(X) = Tr(y) = Tr(Z) = 0 
and the multiplication table 

XY = -YX = iZ, YZ = -ZY = iX, ZX = -XZ = iY, 

and 

= Y^ = Z^ = I. 

From these facts, it is easy to compute that 

TT{p{x)E{m,p)) = m + X ■ p, 

where the factor ^ in the dehnition of p(x) has cancelled the factor 2 
arising from Tr(/). 
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7. Quasiprobability representation 


Finally, we are ready to prove Theorem O 

Suppose, toward a contradiction, that we have a nonnegative quasiprob¬ 
ability representation satisfying convex-linearity, for a two-dimensional 
'H. In view of Proposition |5l we know that 

/^p(x)(A) = X ■ A(A) -|- C'(A) 

and 

iE(m,p) = P- B{X) + mD{X) + F{X) 

for some nine functions A(A), Bi{X), C(A), -D(A), F{X) where the index 
i ranges from 1 to 3. (The “translated” part of “translated-linear” 
accounts for C and F.) 

The definition of quasiprobability representation leads to some sim¬ 
plifications. i7(0,0) is the zero operator, whose associated ^ function 
is required to be identically zero. That gives us F{X) = 0 for all A, so 
we can simply omit F from the formula for Q 

Also, i?(l,0) is the identity operator, whose associated ^ function is 
required to be identically 1. That gives us iA(A) = 1 for all A. So we 
can simplify the ^ formula above to read 

^Eim,p) = P- B{X) m. 


Next, consider the requirement that 


TT{p{x)E{m,p)) 



We already evaluated the trace on the left side of this equation at the 
end of the preceding section. The integral on the right side is 


j[{p- B{X))ix- A{X)) + {p-B{X))CiX)+m{x-A{X))+mC{X)]dX. 


Comparing the trace and the integral, and equating coefficients of the 
various monomials in m, p, and x, we find that 


( 4 ) 

j 5,(A)A,(A) dX 

( 5 ) 

j Bi{X)C{X) dX 

( 6 ) 

j A{X) dX 

( 7 ) 

J C'(A) dX 


< 5 * 


j’ 


0 , 

0, and 

1. 
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Next, we extract as much information as we can from the assumption 
that all the functions /ip and are nonnegative. 

In the case of this means that, as long as ||^| < m, 1 — m (so 
that E{m,p) is a POVM element), we must have m + p ■ B{X) > 0 for 
all A. Temporarily consider a fixed A and a fixed m G [0, ^]. To get 
the most information out of the inequality m + p- B{X) > 0, we choose 
the “worst” vector p, i.e., we make p- B{X) as negative as possible, 
by choosing p in the opposite direction to B{X) and with the largest 
permitted magnitude, namely m. That is, we take 

P = -J^5(A) 

\\B{X)\\ 

so that our inequality becomes 0 < m(l — ||i?(A)||), and therefore 

||5(A)||<1 for all A. 

Repeating the exercise for m G [|, 1] gives no new information. 

So we turn to the case of /ip(^), for which the nonnegativity require¬ 
ment reads 

f •A(A) + C'(A) > 0. 

For each fixed A, we consider the “worst” x, namely a vector x in the 
direction opposite to ^(A) and with the maximum allowed magnitude, 
namely 1. So we take 

^(A) 

P(A)|| 

and obtain the inequality 0 < —||y4(A|| -|- C(A). Thus, we have 
||A(A)|| < C'(A) for all A. 


In particular, C(A) is everywhere nonnegative. 

A trivial consequence of ||A(A)|| < C'(A) is that |Ai(A)| < C(A). 
Similarly, a trivial consequence of ||i?(A)|| < 1 is |i?i(A)| < 1. Putting 
this information into the i = j = 1 case of equation (jl]), and also using 
o. we find that 


1 


j Ri(A)Ai(A) dX 


< 


Ri(A)|-|Ai(A)|dA< 


1 •C'(A)dA = 1. 


So both of the inequalities here must be equalities. In particular, 
|i?i(A)| = 1 for almost all A except where C'(A) = 0. 

Similarly, we get that, for almost all A except where C'(A) = 0, we 
also have |i? 2 (A)| = |R 3 (A)| = 1 and therefore ||.B(A)|| = y^. Since 
we also know ||R(A)|| < 1, we must conclude that C'(A) = 0 almost 
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everywhere. But that contradicts equation ([71), and so the proof of the 
no-go theorem is complete. 
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